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EXPLICIT CRITERIA FOR SEVERAL TYPES OF ERGODICITY 
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Abstract. The explicit criteria, collected in Tables 5.1 and 5.2, for several types of ergodicity of one- 
dimensional diffusions or birth-death processes have been found out recently in a surprisingly short period. 
One of the criteria is for exponential ergodicity of birth-death processes. This problem has been opened 
for a long time in the study of Markov chains. The survey article explains in details the idea which leads 
to solve the problem just mentioned. It is interesting that the problem is connected with several branches 
of mathematics. Some open problems for the further study are also proposed. 



Let us begin with the paper by recalling the three traditional types of ergodicity. 

1. Three traditional types of ergodicity. Let Q = (gy) be a regular Q-matrix on a countable 
set E — {i,j, k, ■ ■ ■ }. That is, q^j > for all i ^ j, q^ := — q^ = Qij < 00 f° r all i G 25 and Q 

determines uniquely a transition probability matrix P(t) = (Pij{t)) (which is also called a Q-process 
or a Markov chain). Denote by tt — (7Tj) a stationary distribution of P(i): 7rP(t) = it for all t > 0. 
From now on, assume that the Q-matrix is irreducible and hence the stationary distribution n is 
unique. Then, the three types of ergodicity are defined respectively as follows. 

Ordinary ergodicity : lim \pij(t) — vr, | =0 (!•!) 

t — >oc 

Exponential ergodicity : lim e at \pij(t) — =0 (1-2) 

t — >oo 

Strong ergodicity : lim sup \pij (t) — 7r,- 1 =0 

t — >oc ^ 

lim e^ sup Ipyft) - nA = 0, (1.3) 

t— »OC i 

where a and (3 are (the largest) positive constants and i,j varies over whole E. The definitions 
are meaningful for general Markov processes once the pointwise convergence is replaced by the 
convergence in total variation norm. The three types of ergodicity were studied in a great deal 
during 1953-1981. Especially, it was proved that strong ergodicity =^> exponential ergodicity =$> 
ordinary ergodicity. Refer to Anderson (1991), Chen (1992, Chapter 4) and Meyn and Tweedie 
(1993) for details and related references. The study is quite complete in the sense that we have the 
following criteria which are described by the Q-matrix plus a test sequence (j/j) only. 

Theorem 1.1 (Criteria). Let H ^ be an arbitrary but fixed finite subset of E. Then the following 
conclusions hold. 

(1) The process P(t) is ergodic iff the system of inequalities 
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has a nonnegative finite solution (yi). 
(2) The process P(t) is exponentially ergodic iff for some A > with A < q i for all i, the system 
of inequalities 



has a nonnegative finite solution (yi). 
(3) The process P(t) is strongly ergodic iff the system (1-4) of inequalities has a bounded nonneg- 
ative solution (yi). 

The probabilistic meaning of the criteria reads respectively as follows: 



where a H = mf{t > the first jumping time : X t G H} and A is the same as in (1.5). The criteria are 
not completely explicit since they depend on the test sequences (y,) and in general it is often non- 
trivial to solve a system of infinite inequalities. Hence, one expects to find out some explicit criteria 
for some specific processes. Clearly, for this, the first choice should be the birth-death processes. 
Recall that for a birth-death process with state space E = Z + = {0, 1, 2, • • • }, its Q-matrix has the 
form: qi^+i = hi > for all i > 0, q%,i-i — at > for alH > 1 and q^ = for all other i ^ j. Along 
this line, it was proved by Tweedie (1981)(see also Anderson (1991) or Chen (1992)) that 



where no = 1 and \i n = bo ■ • ■ b n -i/a\ ■ ■ ■ a n for all n > 1. Refer to Wang (1980), Yang (1986) or Hou 
et al (2000) for the probabilistic meaning of S. However, the condition is not necessary. A simple 
example is as follows. Let a, = b% = i 1 (i > 1) and bo = 1. Then the process is exponential ergodic 
iff 7 > 2 (see Chen (1996)) but S < oo iff 7 > 2. Surprisingly, the condition is correct for strong 
crgodicity. 

Theorem 1.2 (H. J. Zhang, Lin and Hou (1998)). S < 00 Strong ergodicity. 

With a different proof, the result is extended by Y. H. Zhang (2001) to the single-birth processes 
with state space Z + . Here, the term "single birth" means that q^+i > for alH > but q^ > can 
be arbitrary for j < i. Introducing this class of Q-processes is due to the following observation: If 
the first inequality in (1.4) is replaced by equality, then we get a recursion formula for (y^) with one 
parameter only. Hence, there should exist an explicit criterion for the ergodicity (resp. uniqueness, 
recurrence and strong ergodicity). For (1.5), there is also a recursion formula but now two parameters 
arc involved and so it is unclear whether there exists an explicit criterion or not for the exponential 
crgodicity. 

Note that the criteria are not enough to estimate the convergence rate a or (3 (cf. Chen (2000a)). 
It is the main reason why we have to come back to study the well-developed theory of Markov chains. 
For birth-death processes, the estimation of a was studied by Doom in a book (1981) and in a series 
of papers (1985, 1987, 1991). Especially, the precise a was computed out for four particular models. 
This work has been very helpful in the later study, as will be discussed in the next section. However, 
there is no explicit criterion for a > ever appeared so far. The difficulty mainly comes from the 
complex of two parameters and as we will see in the next section, there is indeed an intrinsic reason 
for it. By the way, we point out that there is nearly no result about the estimation of f3. 

Open problem 1. How to estimate f3? 

2. The first (non-trivial) eigenvalue (spectral gap). The birth-death processes have a nice 
property — symmetrizability: ^iPij(t) — [ijPji(t) for all i,j and t > 0. Then, the matrix Q can be 
regarded as a self-adjoint operator on the real L 2 -space L 2 (p) with norm || • ||. In other words, one 
can use the L 2 -theory, which was a starting point of Doom (1981). Furthermore, one can study the 




(1.5) 



maxEifTrr < 00, maxE^e 



< 00 and supEi(7 H < 00, 

i£E 
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L 2 -exponential convergence given below. Assuming that \i := ^ Hi < 00 an d then setting 7Tj = Hi/l-i, 
we have L 2 -space L 2 (w) with norm || • ||. Then, the convergence means that 

||P(t)/-7r(/)||<||/-7r(/)||<e-^ (1.6) 

for all t > 0, where 7r(/) = / fdir and Ai is the first non-trivial eigenvalue of (-Q) (cf. Chen (1992, 
Chapter 9)). 

The estimation of Ai for birth-death processes was studied by Sullivan (1984), Liggett (1989) 
and Landim, Scthuraman and Varadhan (1996) (see also Kipnis & Lamdin (1999)). It was used as 
a comparison tool to handle the convergence rate for some interacting particle systems, which are 
infinite-dimensional Markov processes. 

The present author came to this topic by comparing Ai with a, which is the first result in (2.1). 
This transfers all known results about a to Ai. Then, by using the coupling methods, the author 
obtained a variational formula given in the second line of (2.1). 

Theorem 2.1. 



a = Ai [C7ien(1991)] 



= sup inf Ii(w) 

»6W i>° 



[C7ien(1996)] 



(2-1) 



where W = {w : Wi T1\ n(w) > 0} and Ii(w) 



I 



Liibi(w i+ i - Wi) 



£ 

j>i+i 



In view of Theorem 2.1, in order to obtain a criterion for the exponential ergodicity, one needs 
only to have a representative sequence (wi). Note that the test sequence (w^ used in the formula is 
indeed a mimic of the eigenvector of Ai, and in general, the eigenvector can be very sensitive. Thus, 
it becomes a question from the above formula about the existence of a representative sequence (u>j) 
for justifying the positiveness of a. Let us now leave Markov chains for a while and turn to diffusions. 

3. One-dimensional diffusions. As a parallel of birth-death process, we now consider the elliptic 
operator L = o(x)d 2 /dx 2 + b(x)d/dx on the half line [0, 00) with a(x) > everywhere. Again, we 
are interested in estimation of the principle eigenvalues, which consist of the typical, well-known 
Sturm-Liouvillc eigenvalue problem. Refer to Egorov & Kondratiev (1996) for the present status of 
the study and references. Here, we mention two results, which are the most general ones we have 
ever known before. 

Theorem 3.1. Let b(x) = (which corresponds to the birth-death process with ai = bi for all i >1) 
and set S = svip x>0 x a -1 . Here we omit the integration variable when it is integrated with respect 
to the Lebesgue measure. Then, we have 

(1) Kac & Krein (1958): S^ 1 > A > (4<5)~ 1 , here A is the first eigenvalue corresponding to the 
Dirichlet boundary /(0) = 0. 

(2) Kotani & Watanabe (1982): (T 1 > Ai > (45)" 1 . 

In the diffusion context, we also obtained a variational formula. 
Theorem 3.2. (Chen & Wang (1997a), Chen(2000b)). 



Ai 



sup inf 



-C(x) 



fix) 



f(u)e 



C(u) 



a(u) 



-du 



(3.1) 



where 



f x 1 e c(a:) 
C(x) = b/a, irtdx) = r^r^x (Z is the normalizing constant) and 

Jo z a \ x ) 



Z a(x) 

T={feL 1 (ir) : tt(/) > and /'| (0 ,oo) > 0}. 
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In the papers quoted above, it was proved that the equality in (3.1) replaced by ">" holds in 
general and the equality holds whenever both a and b are continuous. Clearly the equality should 
also hold for certain measurable a and b, in virtue of a standard approximating procedure and (3.2) 
below. Again, it is unclear from the result whether there exists a representative function / e T or 
not to justify the positiveness of Ao or Ai. 

Before moving further, let us recall the classical variational formula for Ai: 

Ai = inf{D(/) : tt(/) = 0, n(f) = 1}, (3.2) 

p OO 

7r(da;) = e c(x) dx/(a(x)Z), D{f) = / a/' 2 d7r. 

Jo 

Note that there is no common point between (3.1) and (3.2). The formula (3.1) (resp. (3.2)) is 
especially meaningful in estimating the lower (resp. upper) bounds of Ai. Now, it is simple matter 
to rewrite (3.2) as (3.3) below. Similarly, we have (3.4) for Ao- 

Poincare inequalities. 

Ai: var(/) < X^D(f) (3.3) 
Ao: Il/H 2 < Ao 1 ^/), /(0) = 0. (3.4) 

It is interesting that inequality (3.4) is a special but typical case of the weighted Hardy inequality 
discussed in the next section. 

4. Weighted Hardy inequality. The classical Hardy inequality goes back to Hardy (1920): 

where the optimal constant was determined by Landau (1926). After a long period of efforts by 
analysts, the inequality was finally extended to the following form. 

Weighted Hardy inequality (Muckenhoupt (1972)). Let v and A be nonnegative Borel mea- 
sures. Then 

/ f 2 dv<A /' 2 dA, /eC\/(0) = 0, (4.1) 
Jo Jo 

where B = sup x>0 v[x, oo] /^(dAabs/dLcb) -1 satisfies B < A < 4£> and dA a bs/dLeb is the derivative 
of the absolutely continuous part of A with respect to the Lebesgue measure. 

The Hardy-type inequalities play a very important role in the study of harmonic analysis and 
have been treated in many publications. Refer to the books: Opic & Kufner (1990), Dynkin (1990), 
Mazya (1985) and the survey article Davies (1999) for more details. The author learnt the weighted 
inequality from Miclo (1999a), Bobkov & Gotze (1999a, b). 

It is now easy to deduce the Poincare inequality (3.4) from (4.1), simply setting v = it and 
A = e c dx. In other words, Muckenhoupt 's Theorem Kac & Krcin's Theorem. 

Thanks are given to the weighted Hardy inequality, from which we learnt that there must exist 
a representative test function. Then, it is not difficult to figure out that the function is simply 

Theorem 4.1 (Chen (2000b, 2001a)). Let S = sup x>0 f* e-° e c /a. Then, we have 

(1) 6' > A > (4<5)-\ where S' is an explicit constant satisfying 5 < 5' < 25. Moreover, Ao 
(resp. AiJ> iff 5 < oo. 

(2) Ao-sup influx*)- 1 

}e jrx>0 

= inf sup/(/)(x) _1 (Completed variational formula!). 

f£F' x>0 
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Here, the set T' is a modification of T (The readers are urged to find out the details from the 
original papers). The lower bound in part (1) is an application of (3.1) to the representative function 
mentioned above. The upper bound comes from a direct computation. Part (1) improves (4.1) in 
the present situation. Clearly, part (2) is much finer result than (4.1) and can not be deduced from 
(4.1). The second line in part (2) is a natural dual of the first one but is completely different from 
the classical one (3.4). 

The result can be immediately applied to the whole line or higher-dimensional situation. For 
instance, for Laplacian on compact Riemannian manifolds, it was proved by Chen & Wang (1997b) 
that 

Ai>sup inf /(/) (r) - 1 =: 6 
fe jr re(o,r>) 

for some /(/), which is similar to the right-hand side of (3.1). We now have 5' > 6 > (4<5) _1 for 
some constants S and 5' similar to Theorem 4.1. Refer to Chen (2000b) for details. 

We now return to birth-death processes. A parallel result of Theorem 4.1 has been presented 
in Chen (2000b). In particular, we obtain a criterion for the positiveness of Ai for birth-death 
processes. This is also done by Miclo (1999b) in terms of discrete Hardy inequalities. Combining 
this with Theorem 2.1, we finally obtain a criterion for the exponential crgodicity of birth-death 
processes. The result is included in Table 5.1 below. We mention that it is now possible to prove the 
last criterion directly from (1.5), based on the study on Ai. However, the more general case remains 
open. 

Open problem 2. Does there exist a criterion for exponential ergodicity of single-birth processes? 

5. Three basic inequalities. Denote by var(f)= ||/ — 7r(/)|| 2 = 7r(/ 2 )—7r(/) 2 . Then, the inequality 
(3.3) can be rewritten as (5.1) below. On the other hand, one may study other inequalities, for 
instance, the logarithmic or Nash inequality listed below. 

Poincare inequality : var(/) < \^ 1 D(f) (5.1) 
LogS inequality : J f log(|/|/||/||)d7r < o-~ 1 D{f) (5.2) 
Nash inequality : v&v(f) 1+2/v < r]' 1 D{f)\\f\\\ /v (for some v > 0). (5.3) 



Here, to save notation, a (resp. 77) denotes the largest constant so that (5.2) (resp. (5.3)) holds. 

Each inequality describes a type of ergodicity. First, (5.1) ^==^ (1.6). Next, the logarithmic 
Sobolev inequality implies the decay of the semigroup P(t) to ir exponentially in relative entropy 
with rate a and the Nash inequality is equivalent to var(P(t)(/)) < C\\f\\i/t v / 2 . For reversible 
Markov chains, a complete diagram of these types of ergodicity was presented in Chen (1999, 2001b). 

Fortunately, the criteria (also based on the weighted Hardy inequality) for the last two inequalities 
as well as for the discrete spectrum (which means that there is no continuous spectrum and moreover, 
all eigenvalues have finite multiplicity) are obtained by Mao (2000a, b,c). We can now summarize the 
results in Table 5.1. The table is arranged in such order that the property in the latter line is stranger 
than the former one, the only exception is that even though the strong ergodicity is often stronger 
than the logarithmic Sobolev inequality but they are not comparable in general (Chen (2001b)). 



BlRTH-DEATH PROCESSES 



i + 1 at rate bi = qi^+i > 
i — 1 at rate a, = q% t i-i > 0. 



Define 



/i = 1, [i n = — n>l; n[i,k]= fij. 



i<j<k 
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Property 


Criterion 


Uniqueness 


„ 1 

n>0 Mn0n 


Recurrence 


/ , — OO 

n>0 M " " 


Ergodicity 


(*) & n[0, 00) < 00 


Exponential ergodicity 


(*) & sup^i[n, 00) — — < 00 

n>l 1 

— 3< n — 1 


Discrete spectrum 


fe-1 

(*) & lim sup /u,[k, 00) > — — = 


LogS inequality 


(*) & sup^i[n, oo)log[yu[n, oo) _1 ] — ;~ <oc 

n>l " ,/'.-''.; 


Strong ergodicity 


(*) & XI — r/ i [ n + 1 ' °°)=y"^™X ~^r <00 

n>V n>l 3 <n-V J 3 


Nash inequality 


(*) & sup/i^oo)^" 2 ^ V 1 <oo(e) 



Table 5.1 

Here, "(*) & • • • " means that one requires the uniqueness condition in the first line plus the condition 
"• • •". The "(e)" in the last line means that there is still a small gap from being necessary. 

Open problem 3. Find out a criterion for the Nash inequality in dimension one. 

In view of Theorem 4.1, it is natural to ask the following 

Open question 4. Does there exist a variational formula in the form of part (2) of Theorem 4.1 
for the logarithmic Sobolev or Nash inequality? 

Diffusion processes on [0, oo) 



Define 



C(x) = [ b/a, ti[x,y}= [ e c /a. 
Jo Jx 



Property 


Criterion 


Uniqueness 


/ n[0,x]e- c W = 00 (*) 
<k 


Recurrence 


/ e- c W=oo 
Jo 


Ergodicity 


(*) & fx[0, oo) < oo 


Poincare inequality 


(*) & sup /j,[x, oo) / e~ c < oo 
x>0 Jo 


Discrete spectrum 


(*) & lim sup/x[x, oo) / e~ c = 


LogS inequality 


(*) & sup oo)log[/u[a;, oo) -1 ] / e~ c < oo 

x>0 ' Jo 


Strong ergodicity 


/■(XJ 

(*) & / M[^°oK C(a:) <oo? 
Jo 


Nash inequality 


(*) & sup fi[x, oof v - 2 V v { e-° < oo(e) 

x>0 Jo 
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Table 5.2 

For Nash inequality, we have the same remark as before. Note that the exponential ergodicity in 
Table 5.1 is replaced by Poincare inequality here. This suggests the following problem which is 
conjectured to be true (partially solved in Chen (1998, 2000a). Here we mention a simple fact 
which was missed in the original papers. In the reversible case, for p a (x,y) := dP s (x, -)/d\, we have 
J{Ps{x,y)/n(y)) 2 Tr(dy) = J p s {x, y)p s {y, x)X(dy)/n(x) = p 2s (x, x)/ir(x) < oo. Hence p s (x, -)/ir e 
L 2 (n)). 

Open problem 5. Prove that the exponentially ergodic convergence rate coincides with the L 2 - 
exponential convergence rate, or at least these two convergences are equivalent. 

The question marked in the line of strong ergodicity means that it remains unproved, but con- 
jectured to be true in terms of the similarity between one-dimensional diffusions and birth-death 
processes. 

Open problem 6. Prove the criterion for strong ergodicity listed in Table 5.2. 

Added in Proof. A large part of the open problems, except 3 and 4, have been solved by Yong-Hua 
Mao and Yu-Hui Zhang in the past few months. 
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